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Abstract: We present an implementation of the next-to- leading order vector boson plus 
one jet production process in hadronic collision in the framework of POWHEG, which is 
a method to implement NLO calculations within a Shower Monte Carlo context. All 
spin correlations in the vector boson decay products have been taken into account. The 
process has been implemented in the framework of the POWHEG BOX, an automated computer 
code for turning a NLO calculation into a shower Monte Carlo program. We present 
phenomenological results for the case of the Z/7 plus one jet production process, obtained 
by matching the POWHEG calculation with the shower performed by PYTHIA, for the LHC, 
and we compare our results with available Tevatron data. 
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1. Introduction 

The process of vector boson production in association with jets plays an important role 
at present hadron colliders. In the early LHC phase, Z plus one jet production will play 
a major role in jet calibration. Furthermore, vector boson production in association with 
jets is an important background to new physics signals. In particular, when the Z decays 
into neutrinos, it becomes a source of missing-energy signals. The analogous process of 
W production, in association with jets, gives origin to an important background to the 
production of a lepton in association with missing energy and jets. 

Z plus one jet production has a very distinctive signature, when the Z decays into 
muons or electrons, and several studies have been performed at the Tevatron, both by the 
CDF [1] and by the DO Collaboration [2, 3, 4, 5]. They are all carried out by correcting 
the measured quantities to the particle level, according to the recommendations developed 
in the 2007 Les Houches workshop [6], and then compared to theoretical calculations, 
computed mostly at the next-to-leading order (NLO) level, and corrected for showering, 
hadronization and underlying-event effects. These corrections, in turn, are extracted from 
shower Monte Carlo (SMC) programs. 

In the present work, we present a calculation for the NLO cross section for vector 
boson plus one jet that can be interfaced to a shower Monte Carlo program, within the 
PDWHEG framework [7, 8]. This is the first time that such calculation has been performed. 
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More specifically, we have implemented this process using the POWHEG BOX, a general com- 
puter code framework for embedding NLO calculations in shower Monte Carlo programs 
according to the POWHEG method. In fact, the POWHEG BOX framework was developed using 
the Z/j + lj process as its first testing example. 1 

It is clear that, by using the POWHEG implementation of V + lj production, the com- 
parison of the theoretical prediction with the experimental results is eased considerably, 
and is also made more precise. Rather than estimating shower and underlying-event cor- 
rections using a parton shower program, one interfaces directly the parton shower to the 
hard process in question, yielding an output that can be compared to the experimental 
results at the particle level. We will carry out this task for the Z + lj case, and compare 
our results to the Tevatron findings. We remark, however, that a further improvement to 
this study could be carried out, by using the POWHEG program to generate the events that 
are fed through the detector simulation, and are directly compared to raw data. We are 
not, of course, in a position to perform such a task, that should instead be carried out by 
the experimental collaborations. 

The paper is organized as follows. In section 2 we give a brief description of our 
calculation. Since we used the POWHEG BOX to implement our process, we refer the reader 
to the POWHEG BOX publication [9], and report here only a few details that are particularly 
relevant to the process in question. In the following, we always consider the Z + lj case, 
since the W + lj is fully analogous. In section 3 we describe the generation of the Z + lj 
sample, together with some consistency checks of our calculation, and new features added 
to the POWHEG BOX. In section 4, we present results for the Tevatron and for the LHC at 
14 TeV. Comparison with available data from the CDF and DO Collaborations are carried 
out. Finally, in section 5, we give our final remarks. 

2. Description of calculation 

We have considered the hadroproduction of a single vector boson plus one jet at NLO 
order, with all spin correlations from the decay taken into account. All fermions (quarks 
and leptons) have been treated in the massless limit (no top-quark contributions have been 
taken into account). 

In order to implement a new process at NLO into the POWHEG BOX, we have to provide 
the following ingredients: 

i. The list of all flavour structures of the Born processes. 

ii. The list of all flavour structures of the real processes. 

iii. The Born phase space. 

iv. The real matrix elements squared for all relevant partonic processes. 

1 In the rest of the paper, for ease of notation, we will refer to the Z/y + lj process simply as "Z + lj 
production", without mentioning the presence of the photon, whose effects, together with all the spin 
correlations of the decay products, have been fully taken into account. 
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v. The finite part of the virtual corrections computed in dimensional regularization or in 
dimensional reduction. 



vi. The Born squared amplitudes B, the colour correlated ones Bij and spin correlated 
ones B^. 

vii. The Born colour structures in the limit of a large number of colours. 

For the case at hand, the list of processes is generated going through all possible 
massless quarks and gluons that are compatible with the production of the vector boson 
plus an extra parton. 

The Born phase space for this process poses no challenges: we generate the momen- 
tum of the vector boson distributed according to a Breit-Wigner function, plus one extra 
light particle. The vector boson momentum is then further decayed into two momenta, 
describing the final-state leptons. At this stage, the momentum fractions x\ and X2 are 
also generated and the momenta of the incoming partons are computed. 
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Figure 1: Sample graphs for the Born, virtual and real contributions to the Z/-f + lj production 
process. 

A sample of Feynman diagrams that contribute to the Z + lj process at the Born level 
(B) is depicted in panels (a) and (b) of fig. 1. Together with the Born diagrams, we have to 
consider the one-loop corrections to the tree level graphs, and the diagrams with an extra 
radiated parton. A sample of virtual and real contributions is depicted in panels (c)-(f) of 
fig. 1. 

We have computed the Born and real contributions ourselves, using the helicity- 
amplitude technique of refs. [10, 11]. The amplitudes are computed numerically in a 
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fortran code, as complex numbers, and squared at the end. The finite part of the vir- 
tual corrections has been taken from the MCFM program [12], that uses the virtual matrix 
elements given in ref. [13] and computed first in ref. [14]. 

The use of the helicity-amplitude technique to compute the amplitudes at the Born 
level facilitates the calculation of the spin-correlated matrix elements B^. In fact, they are 
the Born amplitudes just before being numerically contracted with the polarization vector 
of the initial- or final-state gluon. 

Since we are dealing with three coloured partons at the Born level, the colour correlated 
Born amplitudes Bij are all proportional to the Born one. The two independent colour- 
correlated matrix elements are given by 

B qq , = ±(2C F -C A )B, B qg = ^B, (2.1) 

and, from colour conservation, they satisfy 

Y,*H = C fj B, (2.2) 

where i runs over all coloured particles entering or exiting the process, and Cj j is the 
Casimir constant for the colour representation of particle j. 
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(a) (b) 

Figure 2: Born colour structures of the two Feynman diagrams illustrated in panels (a) and (b) 
in fig. 1. 

Finally, the Born colour structures, in the limit of a large number of colours, are 
straightforward, due to the presence of a single gluon and two quarks at the Born level. 
For the two Born diagrams depicted in fig. 1, the colour structures are shown in fig. 2. 

2.1 Generation cut and Born suppression factor 

The V + lj process differs substantially from all processes previously implemented in 
PDWHEG, in the fact that the Born contribution itself is collinear and infrared divergent. 
In all previous implementations, the Born diagrams were finite, and it was thus possible 
to generate an unweighted set of underlying Born configurations covering the whole phase 
space. In the present case, this is not possible, since they would all populate the very low 
transverse momentum region. Of course, this problem is also present in standard shower 
Monte Carlo programs, where it is dealt with by generating the Born configuration with a 
cut k gen on the transverse momentum of the V boson. After the shower, one must discard 
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all events that fail some transverse momentum analysis cut fc an in order to get a realistic 
sample. The analysis cut k an may be applied to the transverse momentum of the vector 
boson, or to the hardest jet. We assume here, for sake of discussion, that it is applied to 
the vector boson transverse momentum. 

Taking A; an > k gcn is not enough to get a realistic sample. In fact, in an event generated 
at the Born level with a given k T < k gen , the shower may increase the transverse momentum 
of the jet so that the final vector-boson transverse momentum k\ can be bigger than k mi . 
Thus, even if the generation cut is below the analysis cut, it may reduce the number of 
events that pass the analysis cut. Of course, as we lower k gen keeping /c an fixed, we will 
reach a point where very few events below /c gen will pass the analysis cut fc an . In fact, 
generation of radiation with transverse momentum larger than k gen is strongly suppressed 
in POWHEG, and, in turn, radiation from subsequent shower is required to be not harder 
than the hardest radiation of POWHEG. Thus, if we want to generate a sample with a given 
fc an cut, we should choose fc gen small enough, so that the final sample remains substantially 
the same if k gen is lowered even further. 

Together with the option of generating events with a generation cut k gcn , a second 
option for the implementation of processes with a divergent Born contribution is also 
available. It requires that we generate weighted events, rather than unweighted ones. This 
is done by using a suppressed cross section for the generation of the underlying Born 
configurations 

B supp = Bx F(k T ) , (2.3) 

where B is the inclusive NLO cross section at fixed underlying Born variables, k T is the 
transverse momentum of the vector boson in the underlying Born configuration and F(k T ) 
is a function that goes to zero in the k T — > limit fast enough to make B x F(k T ) finite in 
this limit. In this way, -B supp is integrable, and one can use it to generate underlying Born 
configurations according to its value. The generated events, however, should be given a 
weight l/F(k T ), rather than 1, in order to compensate for the initial F{k T ) suppression 
factor. With this method, events do not concentrate in the low k T region, although their 
weight, in the low k T region, becomes divergent. After shower, if one imposes the analysis 
cut, one gets a finite cross section, since it is unlikely that events with small transverse 
momentum at the Born level may pass the cut after shower. 

In recent POWHEG BOX revisions, both methods can be implemented at the same time. 
We wanted in fact to be able to implement the following three options: 

• Generate events using a transverse momentum generation cut A; ge n- 

• Generate events using a Born suppression factor, and a small transverse momentum 
cut, just enough to avoid unphysical values of the strong coupling constant and of 
the factorization scale that appears in the parton distribution functions. In this case, 
since we are also generating events with very small transverse momenta, radiative 
corrections may become larger than the Born term, and negative- weight events could 
be generated. We should then be prepared to track them. This issue will be further 
discussed in the next section. 
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• Apply a Born suppression factor, and set the transverse momentum cut k gCQ to zero. 
In this case the program cannot be used to generate events. It can be used, how- 
ever, to produce NLO fixed order distributions, provided the renormalization and 
factorization scales are set in such a way that they remain large enough even at small 
transverse momentum k^ . This feature is only used for the generation of fixed-order 
distributions. 

The generation cut is activated by setting the token bornktmin to the desired value in the 
powheg . input file. The Born suppression is activated by setting the token bornsuppfact 
to a positive real value. The process-specific subroutine born_suppression sets the sup- 
pression factor of eq. (2.3) to 

F(k T ) = -= ^ ? . (2.4) 

fcj + bornsuppfact 

If bornsuppfact is negative, the suppression factor is set to 1. 

The need of a transverse momentum cut is not only a technical issue. The NLO 
calculation of V + lj production holds only if the transverse momentum of the vector 
boson is not too small. In fact, as the k T decreases, large Sudakov logarithms arise in the 
NLO correction, and the value of the running coupling increases, up to the point where the 
cross section, at fixed order, becomes totally unreliable. These large logarithms should all 
be resummed in order to get a sensible answer in this region. In the POWHEG implementation 
of single vector-boson production [15], in fact, these logarithms are all resummed. Then 
it is clear that some sort of merging between the V + lj and the V production processes 
should be performed at relatively small transverse momentum, in order to properly deal 
with this problem. Here we will not attempt to perform such merging, that we leave for 
future work. We will simply recall, when looking at our results, that we expect to get 
unphysical distributions when the vector boson transverse momentum is too small. We 
will discuss this fact in a more quantitative way in section 3. 

2.2 Negative-weight events 

In the POWHEG method, negative-weight events can only arise if one is approaching a region 
where the NLO computation is no longer feasible. In our study for the V + lj process, 
we approach this region at small transverse momenta. In order to better understand what 
happens there, rather than neglecting negative weights (that is the default behaviour of the 
POWHEG BOX), we have introduced a new feature in the program, that allows one to track also 
the negative- weight events. This feature is activated by setting the token withnegweights 
to 1 (true). If withnegweights is set to 1, events with negative weight can thus appear 
in the Les Houches event file [16, 17]. While we normally set the IDWTUP flag in the Les 
Houches interface to 3, in this case we set it to -4. With this flag, the SMC is supposed to 
simply process the event, without taking any other action. Furthermore, the XWGTUP (Les 
Houches) common block variable is set by the POWHEG BOX to the sign of the event times 
the integral of the absolute value of the cross section, in such a way that its average equals 
the true total cross section. 
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We preferred not to use the option IDWTUP=-3 for signed events with constant absolute 
value. This option is advocated by the Les Houches interface precisely in such cases, and it 
requires that the event weight XWGTUP assumes the values ±1. However, the Les Houches 
interface does not provide a standard way to store the integral of the absolute value of the 
cross section, that would be needed to compute correctly the weight of the event in this 
case. In fact, the XSECUP variable is reserved for the true total cross section ctnlo- More 
specifically, suppose that the total NLO cross section receives contributions from regions 
in the phase space where the differential cross section is positive, and where it is negative, 
so that we can write 

CTNLO = f(+) — |f(-)| ■ (2-5) 

We then generate a sample of N = N + + iV_, with N + events with weight W{ = +1 (so 
that the sum of the Wi on events with positive weights is N+) and events with weight 
Wi = — 1 (so that the sum of the Wi on events with negative weights is — N-) in such a 
way that 

N + = ^ - N, iV_ = ^ - TV . (2.6) 

cr (+) + F(-)l a (+) + \ a (-)\ 

In order to give the correct NLO cross section, the N events should then be weighted with 
the sum of the positive plus the absolute value of the negative part of the cross section, in 
such a way that 

1 N 

xJ2 W i( a (+) + h-)D = a (+) - =(J NLO- (2-7) 

i=l 

Summarizing, since there is no standard way in the Les Houches interface to pass this 
absolute value when IDWTUP=-3, we set it to -4, and set the weight XWGTUP of the i-ih 
event to Wi x (c(+) + |<J(_)|), so that, event by event, we have this information. In this 
case the average value of the XWGTUP variable is equal to the total cross section, as required 
by the Les Houches interface when IDWTUP=-4. 

Notice that, if withnegweights is set to true and a Born suppression factor is also 
present, the events will have variable XWGTUP of either signs. In this case, XWGTUP is set 
to the sign of the event, times the absolute value of the cross section, divided by the 
suppression factor (the output of the born_suppression routine). Also in this case, the 
average value of XWGTUP coincides with the true total cross section. Weighted events are 
also useful if one wants to generate a homogeneous sample from relatively-low up to very- 
high transverse momenta. In this case, it is convenient to pick a very large bornsuppf act 
value, of the order of the maximum transverse momentum one is interested in. The large- 
momentum region will be more populated in this way. The form of the born_suppression 
function can also be changed at will by the user. 

3. Validation of the generated samples 

In this section we discuss the output of the POWHEG BOX for Z+lj production. We consider 
here pp collisions at 1.96 TeV. We use the CTEQ6M pdf set [18]. The factorization and 
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Figure 3: Z transverse momentum in samples U, W and at fixed NLO order. 



renormalization scales (in the computation of the B function) are fixed to the transverse 
momentum of the vector boson of the underlying Born configuration. For the generation of 
radiation, these scales are set by the PDWHEG BOX, according to the prescriptions given in [9]. 
We have first produced one sample with a generation cut k gen = 5 GeV on the transverse 
momentum of the Z boson in the underlying Born configuration. This sample, that we 
call sample U (for unweighted), was produced with positive weights. Then we produced a 
second sample W, where we used a Born suppression factor, with bornsuppf act = 10 GeV, 
and a generation cut of 1 GeV, in order to avoid unphysical values for the strong coupling 
and pdf's (we will assume, in the following, that this tiny generation cut has no effects 
on distributions where the Z boson or the hardest jet have transverse momenta of several 
GeV). We have also set withnegweights to 1, so that we are able to record negative 
weighted events that may arise in the region of very small transverse momenta. Sample W 
is thus weighted, with weights of either signs. We analyzed the events of the two samples 
at the POWHEG level, i.e. without feeding them to a shower Monte Carlo. We begin by 
comparing the transverse momentum distribution of the Z of the two samples, and of the 
fixed order NLO QCD result (obtained as a byproduct of the event generation) in fig. 3. The 
figure should be interpreted in the following way. The weighted sample gives the reference 
result, since it is unaffected by the generation cut. The U sample feels clearly the effect 
of the 5 GeV generation cut. However, for p%, > 10 GeV, the U and W results coincide, 
showing that the effect of the generation cut in this region is fully negligible. A second 
important observation has to do with results in the W sample. It becomes unphysical for 
Pt ~ 5 GeV. This indicates that NLO corrections in this region are out of control, and that 
our program should not be used for p^, < 10 GeV. The fixed order NLO result displays 
a similarly unphysical behaviour in the low transverse momentum region. The similarity 
is more apparent in the right panel of fig. 3, which is given in linear scale. There is a 
very large negative value of the NLO cross section in the bin between 1 and 1.5 GeV. The 
POWHEG result follows the NLO result up to a certain amount of smearing. 

Plots for the transverse momentum of the hardest jet are shown in fig. 4. They are 
similar to the plots in the Z transverse momentum, except for a more modest value of the 
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Figure 4: Transverse momentum of the hardest jet in samples U and W. 
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Figure 5: Rapidity distribution of the Z and of the hardest jet in samples U and W, with a p T 
cut on the transverse momentum of the Z/jet of 10 GeV 

NLO distribution at small transverse momenta. 

Finally, in fig. 5, the rapidity distribution of the Z and of the hardest jet are plotted 
for the U and W sample and for the NLO contribution. A cut of 10 GeV is applied on 
either the Z or the jet transverse momentum. We see, again, very good agreement between 
the U and W samples at this transverse momentum. We have also analyzed many other 
distributions for the U and W sample. They all lead to the conclusion that, when cuts on 
the transverse momenta of the order of 10 GeV (twice the generation cut) are applied, the 
U sample yields distributions that are equivalent to the W sample. More specifically, at 
10 GeV, in the differential cross section of the jet or of the Z transverse momentum, the 
U sample differs from the W sample by 6%, and the difference decreases with p T . Thus, 
we conclude that for p T > 10 GeV, at Tevatron energies, the U sample is substantially 
independent of a generation cut less or equal than 5 GeV. 

3.1 Negative weights and folding 

In order to get a reasonably small fraction of negative- weight events in the P0WHEG BOX, it is 
at times necessary to increase the folding parameters f oldcsi, f oldy and f oldphi. These 
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parameters were introduced in the framework of the POWHEG implementation of heavy- 
quark pair production [19], and are mentioned in the corresponding manual [20], and 
discussed extensively in ref. [9]. We briefly recall here their genesis and use. In POWHEG, 
the underlying Born configuration for an event is generated according to the B function, 
that, in the simplest cases, can be represented by the equation (see ref. [8]) 

B{<&b) = B{* B ) + V($ B ) + J d$ rad [R ($B, <*>rad) " C ($ B , $ rad )] . (3.1) 

In order to generate &b distributed according to this probability density, we actually 
parametrize the radiation variables in terms of three variables in the unit cube, JQ, and 
then perform the integral 

' d$ B d z X B($ B ,X), (3.2) 



where 



B(<Z> B ,X) =B(<S> B ) + V($ B ) + 



dX 



[i2(*B,*rad)-C(* B ,*rad)], (3-3) 



using the MINT integration program [21]. This program stores appropriate grids and upper 
bounds, so that, after the integration, it is possible to use it to generate points in the 
integration variables, i.e. in (<&b, X), distributed with a probability proportional to B. The 
X values are ignored, which amounts to integrating over them, and only is retained. 
This procedure is sufficient to generate positive weights if the process in question has a 
sufficiently-high scale, like in vector boson, Higgs or ti production. In these cases, a s 
(present in eq. (3.3) in the virtual V, real R and counterterms C contributions) is small 
enough so that B is always larger than the other terms, yielding a positive result for any 
value of &b and X. As the scale of the process decreases (and a s increases), it becomes 
more likely that B may become negative for some value of the X parameters, even if B is 
positive for any value of &b- Of course, this depends upon the way that the C counterterm 
is defined. A brute force remedy to this problem, is to fold the integration of the (R — C) 
term as many time as necessary to yield a positive B (that is to say, to get a negligible 
fraction of negative- weight events). The details of the folding method are illustrated in 
the MINT manual. Here we only illustrate it with an example. If we want to fold the X\ 
variable twice, we define 

#folded($B, Xi, X 2 , X 3 ) = B($ B , X!,X 2 ,X 3 ) + B{$ B , 1/2 + X!,X 2 ,X 3 ), (3.4) 

and integrate in xi from to 1/2. This clearly yields the same result as without the 
folding. Furthermore, if we increase the number of foldings for all the three parameters 
Xi, the folded function becomes less and less dependent upon the X{ values. With this 
technique, regions of integration where the function is positive and negative are combined 
together. It is clear that, if the B function is positive, with large enough folding numbers, 
we will achieve the positivity of the folded B function. 

The MINT integrator and the POWHEG BOX can perform the folding automatically. The 
user needs only to specify how many times each variables should be folded. The MINT 
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integrator divides each integration coordinates in 50 bins, such that each bin contributes 
equally to the integral of the absolute value of the B function. Folding is achieved by 
overlapping these intervals. Thus, the folding number must be a divisor of 50: 2, 5, 10, 25 
or 50. Three tokens in the powheg. input file can be set to the folding number: if oldcsi, 
if oldy and if oldphi, that refer to the folding of the three radiation variables £, y and (p. 

In fig. 6, we display the effect of folding on the amount of negative weights that 
enter the computation of a physical quantity, namely the Z transverse momentum (by 
choosing similar variables, like the transverse momentum of the hardest jet, we get similar 
results). All the histograms in the figure were obtained with the withnegweights flag 




25 50 75 100 125 150 



Pt (GeV) 

Figure 6: The effect of folding on the negative-weight event fraction. 

set to true. The solid line was obtained in the standard way, i.e. by adding positive and 
negative contributions. The dashed, dotted and dotdashed lines were obtained by plotting 
only the absolute value of the negative weights. The dashed line was obtained with no 
folding, the dotted line was obtained with a 2-5-1 folding (i.e. if oldcsi= 2, ifoldy= 5 
and ifoldphi= 1), and the dot-dashes line was obtained with the 5-10-5 folding. From the 
figure, we see that the amount of negative weights increases as the transverse momentum 
becomes smaller. This can be understood as being due to the increase of a s at small 
scales. By increasing the folding numbers, the number of negative weights is strongly 
reduced, and, furthermore, it only affects the region of very small transverse momenta, 
where the calculation becomes however unreliable. 

Increasing the folding number has a cost on the execution time. As a rule of thumb, we 
expect the execution time to increase as the product of the three folding numbers. Actually, 
the increase is somewhat less than that, because the Born and virtual contribution are 
evaluated only once for each set of foldings, since they do not depend upon the integration 
variables. Thus, the folding should be chosen as a function of the generation cut, and, 
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ultimately, as a function of the cutoff one is imposing on the hardest-jet transverse energy 
or on the Z transverse momentum. 



£-y-<f> folding ==>■ 


1-1-1 


2-5-2 


5-5-5 


5-10-5 


10-10-10 


LHC, 14 TeV, k gcn = 5 GeV 


0.25 


0.067 


0.026 


0.016 


0.0074 


LHC, 14 TeV, k gcn = 10 GeV 


0.17 


0.020 


0.0029 


0.00036 


3.4 x 10" 5 


Tevatron, 1.96 TeV, fc gen = 5 GeV 


0.23 


0.029 


0.0086 


0.0043 


0.0028 



Table 1: Negative-weight fractions <7(_)/(<7( + ) + |c(-)|) for given folding numbers. 

As an example, in table 1, we have collected the fraction of negative weights a^/(cr^ + 
for the LHC at 14 TeV, with two generation cuts, and for the Tevatron. As can be 
seen, the higher is the folding, the smaller is the fraction of negative-weight events. 

The typical performance of event generation is, without folding, roughly 1000 events 
per minute, on typical workstation cpu's. It is clear that, in the worse case of very low 
transverse-momentum jets, assuming that we can tolerate around 2% negative weights (that 
would populate, in all cases, the very small p T region), by using a 5-5-5 folding we end up 
generating of the order of eight events per minute. It is also clear that we have a compelling 
reason to generate independently samples with different generation cuts. Besides benefiting 
of a more abundant generation in the high p T region, we also benefit from the possibility 
of using smaller folding numbers. 

A clarifying consideration on negative weights in P0WHEG can be made in comparison 
with MC@NL0 [22]. The negative weights that we get in P0WHEG are fully analogous to the 
negative weights that one gets in the S events in MC0NL0. Also in that case, the number 
of negative weights could be reduced or eliminated by using a folding technique, as in our 
case. However, negative weights in the % event sample of MC@NL0 cannot be reduced, and 
so there is no reason to attempt to reduce the negative weight fraction in the S events. 

Finally, let us stress again that by performing any analysis on a P0WHEG event sample 
with signed events, or with weighted events, or with positive- weight events (using eventually 
a large enough folding number to get a negligible fraction of negative weights) yields the 
same result within statistics. This property can be easily verified by running the program 
with the same powheg . input files that differs only for the presence of the withnegweights 
flag set to 1, or for the presence of the bornsuppf act token, or, again, differing only in the 
folding numbers. In the current version of the POWHEG BOX we leave this choice to the user. 
The performance cost of getting positive weights may be well balanced if the events have to 
be run through costly detector simulators, or if it is undesirable to have negative weighted 
events in some complex, multivariate statistical analysis. The default POWHEG BOX setting 
is with no negative weights and no weighted events (i.e. F(k T ) = 1). 

3.2 Comparison with MCFM 

As a final check of the code, we have compared the NLO output of the POWHEG BOX to 
the output produced by the NLO code from MCFM [23], and total agreement was found. In 
spite of the fact that we have taken the virtual correction formulae (that we have checked 



- 12 - 



against the BlackHat [24] results too) from the MCFM program, this comparison is a highly 
non-trivial check, since the subtraction schemes used by the two programs are completely 
different. 

4. Phenomenology 

In this section we directly compare the output of our program to distributions that have 
been measured by the CDF and DO Collaborations at the Tevatron. The only aim of this 
study is to validate, to some extent, our program. We believe that a more thorough analysis 
can only be performed by the experimental collaborations themselves, using our event gen- 
erator. All the results displayed in this section have been obtained with a sample of roughly 
1.3 million events, at the center-of-mass energy of 1.96 TeV, in proton-antiproton collisions. 
The sample was generated with positive weights, with the folding parameters 5-10-5 (see 
section 3.1). The generation cut was set to 5 GeV. The fraction of negative- weight events 
generated with this setting of parameters was 0.4%. As shown in section 3.1, this fraction 
is concentrated in events with low transverse momenta. We have used the CTEQ6M pdf 
set, with the corresponding value of A QCD . The renormalization and factorization scales 
for the calculation of the B function are set equal to the Z transverse momentum in the 
underlying Born configuration. 

The events were showered using PYTHIA 6.4.21 [25]. We have compared two different 
choices of tune for PYTHIA: the tune A (PYTUNE(IOO) ), that uses the old shower and under- 
lying event model, and the Perugia tune (PYTUNE(320) ), that uses the new transverse- 
momentum ordered shower and the new underlying event model. Results for the Tune A 
model will be displayed as blue dashed histograms, while the Perugia will be shown as 
red solid lines. The data will be displayed as simple points with error bars. 

We have switched off photon radiation off leptons (mst j (41) =3), so that, in the case of 
Z —7- 1 + £~ , the lepton energy better represents what would be measured in an electromag- 
netic calorimeter. In addition we have set the Z mass and width to the values 91.188 GeV 
and 2.486 GeV respectively, sin 2 0§ = 0.2312 and a~^{M z ) = 128.930. 

We applied the jet algorithm to all particles in the event, including all leptons, except 
for those coming from Z decay. In other words, when comparing to experimental results, 
we assume that the jet energies are fully corrected to the particle level, including those 
particles that would not be visible in the detector. We have used the jet algorithms as 
implemented in the FAST JET package [26]. 

4.1 CDF results 

The CDF Collaboration provided and still provides results for Z/7 (— > e + e~) + lj and 
Z/7 (— > + lj events. In order to perform an analysis as similar as possible to 

the one done by the CDF Collaboration, we used the midpoint algorithm [27] to combine 
hadrons (from POWHEG events showered by PYTHIA) into jets, with cone radius R = 0.7 and 
a merging/splitting fraction of 0.75, starting from seed towers with transverse momenta 
above 1 GeV. 
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Z/7 (— > e+e ) + jets 

We begin by considering the Z/7 — > e + e~ results of CDF. In figures 7, 8 and 9 we compare 
the POWHEG results showered by PYTHIA using Tune A (blue dashed lines) and Perugia 
tuning (red solid lines), with the CDF data. 
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Figure 7: Total cross section for inclusive jet production. 
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Figure 8: The inclusive p T distributions for events with at least one and two jets. 

The results in figs. 7 and 8 were published in [1], while those in fig. 9 were extracted 
from the blessed results in [28], at an integrated luminosity of 2.5 fb _1 . 

In figs. 7 and 8 we plot results for the inclusive total cross section and inclusive p T 
distributions for the production of at least one and two jets. In fig. 9 we plot the p T 
distribution of the hardest and next-to-hardest jet and the inclusive rapidity distributions 
for events with at least one and two jets. 

In order to compare with CDF data, we adopted the following cuts 

66 GeV < M ee < 116 GeV, p% > 25 GeV, |r? ei | < 1.0, 1.2 < \rf*\ < 2.8, 
|y jct | < 2.1, ^ T ct > 30 GeV, Ai? e>jet > 0.7, (4.1) 
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Figure 9: p T distributions of the hardest and next-to-hardest jet and the inclusive rapidity distri- 
butions for events with at least one and two jets. 

where y and rj represent the rapidity and pseudorapidity of the specified particles, and 
where R is the distance in the azimuth-rapidity plane. 

We notice the good agreement between the P0WHEG prediction and the data. It parallels 
the agreement between data and the NLO MCFM result displayed in refs. [1, 28], despite the 
fact that, when more than two jets are considered, MCFM has NLO accuracy, while our 
generator is limited to leading order. However, we emphasize that the P0WHEG results are 
directly compared to data, while the MCFM ones are first corrected by parton-to-hadron 
correction factors, as detailed in [1]. Notice also the dependence of the results from the 
chosen tune of PYTHIA. The Perugia tune seems to give a slightly better agreement with 
data. We point out that the differences between the P0WHEG results and the data is of 
the same order of the differences between the two tunes, thus suggesting that, by directly 
tuning the P0WHEG results to data, one may get an even better agreement. 

Z/7 (->• H + H~) + jets 

Similar studies for the Z/7 decaying in the fj, + fi~ channel were also performed by CDF. 
In figs. 10 and 11 we display the total cross section for inclusive jet production and the 
inclusive p T and rapidity distributions for events with at least one and two jets. In order to 
perform an analysis as close as possible to the CDF experimental settings, we have applied 
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Figure 10: Total cross section for inclusive jet production. 
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Figure 11: The inclusive p T and rapidity distributions for events with at least one and two jets. 



the following cuts 

66 GeV < Mw < 116 GeV, > 25 GeV, |^| < 1.0, 

|yj ct |<2.1, ^ > 30 GeV, Ai?^ > 0.7 . (4.2) 

In this case, no clear conclusions can be drawn on which of the two chosen tunes better 
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reproduces the data. 



4.2 DO results 

The DO Collaboration performed analyses similar to those done by the CDF Collaboration, 
focusing on more exclusive jet cross sections, and considering also some angular distribu- 
tions. 

To perform an analysis as similar as possible to the one done by the DO Collaboration, 
we used the DO Run II iterative seed-based cone jet algorithm [29] in order to recombine 
hadrons into jets, with a splitting/merging fraction of 0.5 and a cone radius R = 0.5. 

Z/7 (-»■ H + H~) + jets 

In ref. [3, 5], several distributions were studied by the DO experiment using the set of cuts 



65 GeV < Mau < 115 GeV, V t > 15 GeV, \rf\ < 1.7, 



\V 



J ct l 



< 2.8, p J T ot > 20 GeV, AR^et > 0.5 



(4.3) 



In fig. 12, we plot distributions of the transverse momentum and rapidity of the hardest 
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Figure 12: Distributions of the transverse momentum and rapidity of the hardest jet and of the 
Z boson. 



jet and of the Z boson. In fig. 13, we show the azimuthal separation and rapidity separation 
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Figure 13: Azimuthal separation and rapidity separation between the Z boson and the leading 
jet, for events with p% > 25 GeV and p^ > 45 GeV. 
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Figure 14: Absolute value of the average rapidity of the Z and of the leading jet, |j/boost(^ + jet) j , 
for events with p§ > 25 GeV and p% > 45 GeV. 



between the Z boson and the leading jet, for events with > 25 GeV and p§ > 45 GeV, 
while in fig. 14 we plot the absolute value of the average rapidity of the Z and of the leading 
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jet, |yboost(^ -h jet) | , for events with p% > 25 GeV and > 45 GeV, where 

yboost(^ + jet) = ^(y z + y jct ) , (4.4) 

yj ct being the rapidity of the leading jet. 

In ref. [5], the DO Collaboration plotted the normalized differential cross sections, in 
order to reduce the associated error bars. We have, instead, preferred to plot the differential 
cross sections itself, in order to appreciate how well POWHEG reproduce not only the shapes 
but also the overall normalization. The total cross section they use to normalize is given 
in [3] and is equal to 

a z = 118 ± 0.5 (stat.) ± 4 (syst.) ± 4 (muon) ± 7 (lumi.) pb . (4.5) 

In addition, we have added, in quadrature, a flat systematic luminosity error of 8% to every 
bin. 

We see a noticeable discrepancy between data and the POWHEG prediction in the shape 
of the transverse-momentum spectra of the jet and of the Z. This discrepancy also manifest 
itself in differences in the normalization of the angular distributions in figs. 13 and 14, since 
transverse momentum cuts are applied there. We do not wish to comment further on this 
problem, that, on the other hand, is not present in the CDF analyses, and in the DO 
analysis of the Z/j — > e + e" channel. We remark that the problem may well be due to our 
failure to understand some features of the DO analysis, rather than to problems in the data 
or in POWHEG. We also notice that the angular correlation between the Z and the jet p T is 
only qualitatively described by POWHEG. We remind the reader, however, that POWHEG has 
only LO accuracy for this quantity, that is determined by the emission of a second hard 
parton not included in the hardest jet, and that near the back-to-back region, Sudakov 
resummation effects, as well as non perturbative effects, become determinant. 

Z/ 7 (-> e+e") + jets 

DO has published studies for the Z/7 (— >■ e + e~) + lj channel in ref. [4]. In fig. 15 we have 
plotted the p T distributions of the hardest, next-to-hardest and next-to- next-to-hardest 
jet. No information is given in the paper on the value of the total inclusive Z cross section 
used to normalized the distributions. It is reasonable to use the same total cross section of 
eq. (4.5). 

The cuts that we have applied for this analysis were the following 

65 GeV < M ee < 115 GeV, p% > 25 GeV, \rj e \ < 1.1 or 1.5 < \rj e \ < 2.5 , 

\y ict \ < 2.5 , p*T > 20 GeV . (4.6) 

We can see quite a good agreement among the data and the two POWHEG distributions for 
the hardest and next-to-hardest jet. Unexpectedly, there is quite a good agreement for the 
next-to-next-to-hardest jet too: in fact, this jet is a shower jet, so we do not expect it to 
be correct away from the collinear limit. 
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Figure 15: p T distributions of the hardest, next-to-hardest and next-to-next-to-hardest jet. 



4.3 LHC results 

In this section, we show a few results for the LHC at 14 TeV. We have produced 1.400.000 
events, generated with the withnegweights flag set to 1, with k gen = 5 GeV and with 
a 5-10-5 folding. With this folding, the fraction of negative weights is around 1.6%. We 
have then showered the events generated by PDWHEG with PYTHIA, using the tune A and 
the Perugia tuning. In figs. 16 and 17 we have plotted a few distributions obtained for 
the LHC, using the same cuts and the same jet algorithms used by the two Collaborations 
at the Tevatron: in fig. 16, we have applied the cuts of eq. (4.1) that CDF used for the 
Z/l e + e~) + lj analysis, while in fig. 17, we have applied the cuts of eq. (4.3), that DO 
used for the Z/^f (— > /i + /i~) + lj study. 

In fig. 16 we can see a pattern of the two tunes similar to the one depicted in the 
corresponding fig. 9: small differences for the distribution of the hardest jet, some differ- 
ences both at low and high p T for the next-to-hardest jet, and increasing differences in 
their inclusive rapidity distributions, with peaks of ~ 20%. Similarly, fig. 17 should be 
compared with the corresponding panels in figs. 12 and 13. Tune A gives a higher cross 
section at low transverse momentum and central rapidity of the vector boson with respect 
to the Perugia tuning, while they agree very well in the azimuthal separation and rapidity 
separation between the Z boson and the leading jet, for events with > 25 GeV. 
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Figure 16: p T distributions of the hardest and next-to-hardest jet and the inclusive rapidity 
distributions for events with at least one and two jets at the LHC at 14 TeV, with the cuts of 
eq. (4.1). 

5. Conclusions 

In this paper, we have presented, for the first time, a calculation of vector boson plus one 
jet NLO cross section interfaced to a Shower Monte Carlo program, within the POWHEG 
framework. More specifically, we have implemented this process using the POWHEG BOX, a 
general computer code framework for embedding NLO calculations in shower Monte Carlo 
programs according to the POWHEG method. 

We have compared the POWHEG results for Z/j + lj with the data available from the 
CDF and the DO Collaborations, using two different tunes for PYTHIA. We notice that 
differences between the POWHEG results and the data is of the same order of the differences 
between the two tunes, thus suggesting that, by directly tuning the POWHEG results to data, 
one may get an even better agreement. 

Finally, we have presented similar results for the LHC, running at 14 TeV, showing 
that, in some distributions, the difference between the two tunes is of the order of 20%. 

The code of our generator can be accessed in the POWHEG BOX svn repository: 
svn : //powhegbox . mib . inf n . it/trunk/POWHEG-BOX, with username anonymous and pass- 
word anonymous. 
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Figure 17: Distributions of the transverse momentum and rapidity of the Z boson and azimuthal 
separation and rapidity separation between the Z boson and the leading jet, for events with p^, > 
25 GeV, at the LHC at 14 TeV, with the cuts of eq. (4.3). 
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